Search CORE

D-Scholarship@Pitt

To bind or not to bind - FoxA1 determines estrogen receptor action in breast cancer progression

Author: Benos PV
Oesterreich S
Watters RJ
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/06/2012
Field of study

Chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-seq) is rapidly enabling the comprehensive characterization of genome-wide transcription factor-binding sites, thus defining the cistrome (cis-acting DNA targets of a trans-acting factor). Estrogen receptor (ER) ChIP-seq studies have been performed mainly in cell lines, but Ross-Innes and colleagues have now completed the first such study in clinical breast cancer samples. The study aimed at determining the dynamics of ER binding and differences between more and less aggressive primary breast tumors and metastases. The authors found that ER bound to DNA in both aggressive and drug-resistant tumors but to different sites and with different affinities. Given previous findings from cell lines, FoxA1 appears to play a critical role in this reprogramming of ER binding. © 2012 BioMed Central Ltd

D-Scholarship@Pitt

Inferring Binding Energies from Selected Binding Sites

Author: A Sarai
AE Kel
C Tuerk
Christopher Workman
DA Gilchrist
David Granas
DS Fields
DSF Homsi
E Roulet
E Sharon
Gary D. Stormo
GD Stormo
GD Stormo
GD Stormo
GD Stormo
H Ji
HF Teh
HG Roider
J Linnell
J Liu
JB Kinney
JJ Moré
L van Oeffelen
M Djordjevic
M Djordjevic
MF Berger
ML Lee
MQ Zhang
O Berg
PH von Hippel
PV Benos
PV Benos
Q Zhou
R Staden
SJ Maerkl
TH Cormen
TK Blackwell
TK Man
U Gerland
V Mustonen
VH Nagaraj
WE Wright
X Liu
X Meng
Y Takeda
Yue Zhao
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

We employ a biophysical model that accounts for the non-linear relationship between binding energy and the statistics of selected binding sites. The model includes the chemical potential of the transcription factor, non-specific binding affinity of the protein for DNA, as well as sequence-specific parameters that may include non-independent contributions of bases to the interaction. We obtain maximum likelihood estimates for all of the parameters and compare the results to standard probabilistic methods of parameter estimation. On simulated data, where the true energy model is known and samples are generated with a variety of parameter values, we show that our method returns much more accurate estimates of the true parameters and much better predictions of the selected binding site distributions. We also introduce a new high-throughput SELEX (HT-SELEX) procedure to determine the binding specificity of a transcription factor in which the initial randomized library and the selected sites are sequenced with next generation methods that return hundreds of thousands of sites. We show that after a single round of selection our method can estimate binding parameters that give very good fits to the selected site distributions, much better than standard motif identification algorithms

Infoscience - École polytechnique fédérale de Lausanne

Digital Commons@Becker

Probing the Informational and Regulatory Plasticity of a Transcription Factor DNA–Binding Domain

Transcription factors have two functional constraints on their evolution: (1) their binding sites must have enough information to be distinguishable from all other sequences in the genome, and (2) they must bind these sites with an affinity that appropriately modulates the rate of transcription. Since both are determined by the biophysical properties of the DNA–binding domain, selection on one will ultimately affect the other. We were interested in understanding how plastic the informational and regulatory properties of a transcription factor are and how transcription factors evolve to balance these constraints. To study this, we developed an in vivo selection system in Escherichia coli to identify variants of the helix-turn-helix transcription factor MarA that bind different sets of binding sites with varying degrees of degeneracy. Unlike previous in vitro methods used to identify novel DNA binders and to probe the plasticity of the binding domain, our selections were done within the context of the initiation complex, selecting for both specific binding within the genome and for a physiologically significant strength of interaction to maintain function of the factor. Using MITOMI, quantitative PCR, and a binding site fitness assay, we characterized the binding, function, and fitness of some of these variants. We observed that a large range of binding preferences, information contents, and activities could be accessed with a few mutations, suggesting that transcriptional regulatory networks are highly adaptable and expandable

FigShare

Expression of Regulatory Platelet MicroRNAs in Patients with Sickle Cell Disease

Author: A Abdollahi
A Dixon-McIver
A Osman
A Schedel
A Smolenski
A Tomer
AA Kondkar
AD Blann
AI Badeaux
AJ Saldanha
AM Healy
BM Steele
Claudia Coronnello
D Betel
D Betel
DA Hosack
DP Bartel
DV Gnatenko
EM Novelli
ER Popescu
G Bazzoni
Gregory J. Kato
Guoying Yu
H Bruchova
H Seitz
HA Brittain
I Hers
I Toma
J Villagra
Jen-Tsan Ashley Chi
JF Noronha
JS Mohan
K Joop
K Kaushansky
K Konishi
Kimberly Woodhouse
LL Horstman
LR Queen
M Kannan
M Kertesz
Maria G. Kapetanaki
Mark T. Gladwin
MC Ammons
ML Freedman
ML Jison
MP Hunter
N Chavda
N Raghavachari
Naftali Kaminski
Nalini Raghavachari
P Landry
Panayiotis V. Benos
PV Browne
R Antonucci
R Garzon
R Ross
R Visone
RC Friedman
RJ Berckmans
RL Nachman
RT Schermuly
S Amisten
S Kim
S Masaki
S Mendjan
S Nagalla
Shilpa Jain
SP Lee
Suchitra Barge
T Wun
V Ambros
W Huang da
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 12/04/2013
Field of study

Background: Increased platelet activation in sickle cell disease (SCD) contributes to a state of hypercoagulability and confers a risk of thromboembolic complications. The role for post-transcriptional regulation of the platelet transcriptome by microRNAs (miRNAs) in SCD has not been previously explored. This is the first study to determine whether platelets from SCD exhibit an altered miRNA expression profile. Methods and Findings: We analyzed the expression of miRNAs isolated from platelets from a primary cohort (SCD = 19, controls = 10) and a validation cohort (SCD = 7, controls = 7) by hybridizing to the Agilent miRNA microarrays. A dramatic difference in miRNA expression profiles between patients and controls was noted in both cohorts separately. A total of 40 differentially expressed platelet miRNAs were identified as common in both cohorts (p-value 0.05, fold change>2) with 24 miRNAs downregulated. Interestingly, 14 of the 24 downregulated miRNAs were members of three families - miR-329, miR-376 and miR-154 - which localized to the epigenetically regulated, maternally imprinted chromosome 14q32 region. We validated the downregulated miRNAs, miR-376a and miR-409-3p, and an upregulated miR-1225-3p using qRT-PCR. Over-expression of the miR-1225-3p in the Meg01 cells was followed by mRNA expression profiling to identify mRNA targets. This resulted in significant transcriptional repression of 1605 transcripts. A combinatorial approach using Meg01 mRNA expression profiles following miR-1225-3p overexpression, a computational prediction analysis of miRNA target sequences and a previously published set of differentially expressed platelet transcripts from SCD patients, identified three novel platelet mRNA targets: PBXIP1, PLAGL2 and PHF20L1. Conclusions: We have identified significant differences in functionally active platelet miRNAs in patients with SCD as compared to controls. These data provide an important inventory of differentially expressed miRNAs in SCD patients and an experimental framework for future studies of miRNAs as regulators of biological pathways in platelets. © 2013 Jain et al

D-Scholarship@Pitt

Effects of Ploidy and Recombination on Evolution of Robustness in a Model of the Segment Polarity Network

Author: A Bergman
A Gardner
A Inga
A Wagner
A Wagner
AL Hughes
AO Wilkie
AS Kondrashov
B Lemos
BA Edgar
C Zeyl
CD Meiklejohn
CH Waddington
CI Castillo-Davis
CR Haag
CR Landry
D Denver
D Wheeler
DA Thompson
DJ Tomso
E Meir
E Meir
G Conant
G Gibson
G Gibson
G Jimenez-Sanchez
G von Dassow
G von Dassow
H Kacser
J Kidd
JA de Visser
JB Anderson
JBS Haldane
KA Hughes
Kerry J. Kim
KJ Kim
KJ Kim
Lauren Ancel Meyers
LW Ancel
M Félix
M Kimura
M Lynch
MA Nowak
MA Savageau
ML Siegal
ML Siegal
MV Rockman
N Phadnis
N Tokuriki
NG Rahim
NT Ingolia
PJ Wittkopp
PV Benos
PV Benos
RA Fisher
RA Veitia
RB Azevedo
S Ciliberti
S Clodong
S Elena
S Rifkin
S Wright
SP Otto
SP Otto
SP Otto
SR Proulx
Vilaiwan M. Fernandes
W Ma
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Many genetic networks are astonishingly robust to quantitative variation, allowing these networks to continue functioning in the face of mutation and environmental perturbation. However, the evolution of such robustness remains poorly understood for real genetic networks. Here we explore whether and how ploidy and recombination affect the evolution of robustness in a detailed computational model of the segment polarity network. We introduce a novel computational method that predicts the quantitative values of biochemical parameters from bit sequences representing genotype, allowing our model to bridge genotype to phenotype. Using this, we simulate 2,000 generations of evolution in a population of individuals under stabilizing and truncation selection, selecting for individuals that could sharpen the initial pattern of engrailed and wingless expression. Robustness was measured by simulating a mutation in the network and measuring the effect on the engrailed and wingless patterns; higher robustness corresponded to insensitivity of this pattern to perturbation. We compared robustness in diploid and haploid populations, with either asexual or sexual reproduction. In all cases, robustness increased, and the greatest increase was in diploid sexual populations; diploidy and sex synergized to evolve greater robustness than either acting alone. Diploidy conferred increased robustness by allowing most deleterious mutations to be rescued by a working allele. Sex (recombination) conferred a robustness advantage through “survival of the compatible”: those alleles that can work with a wide variety of genetically diverse partners persist, and this selects for robust alleles

Optimized mixed Markov models for motif identification

Author: AE Kel
B Matthews
B Negre
C Burge
D Cai
David M Umbach
E Roulet
E Wingender
G Schwarz
G Yeo
GA Wray
GD Stormo
GE Crooks
H Akaike
I Carmel
J Rissanen
JP Staley
K Ellrott
K Nandabalan
K Nelson
K Quandt
Leping Li
M Kellis
MG Reese
ML Bulyk
MP Ponomarenko
MQ Zhang
N Saitou
P Agarwal
P Bühlmann
PV Benos
Q Zhou
R Staden
RP Ketterling
S Salzberg
T Thanaraj
TD Schneider
TK Man
U Ohler
Uwe Ohler
W Krivan
Weichun Huang
X Xie
X Zhao
Y Barash
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Identifying functional elements, such as transcriptional factor binding sites, is a fundamental step in reconstructing gene regulatory networks and remains a challenging issue, largely due to limited availability of training samples. RESULTS: We introduce a novel and flexible model, the Optimized Mixture Markov model (OMiMa), and related methods to allow adjustment of model complexity for different motifs. In comparison with other leading methods, OMiMa can incorporate more than the NNSplice's pairwise dependencies; OMiMa avoids model over-fitting better than the Permuted Variable Length Markov Model (PVLMM); and OMiMa requires smaller training samples than the Maximum Entropy Model (MEM). Testing on both simulated and actual data (regulatory cis-elements and splice sites), we found OMiMa's performance superior to the other leading methods in terms of prediction accuracy, required size of training data or computational time. Our OMiMa system, to our knowledge, is the only motif finding tool that incorporates automatic selection of the best model. OMiMa is freely available at [1]. CONCLUSION: Our optimized mixture of Markov models represents an alternative to the existing methods for modeling dependent structures within a biological motif. Our model is conceptually simple and effective, and can improve prediction accuracy and/or computational speed over other leading methods

Springer - Publisher Connector

MDC Repository

A Linear Model for Transcription Factor Binding Affinity Prediction in Protein Binding Microarrays

Author: A Beyer
A Sandelin
A Seth
A Tanay
AA Philippakis
B Foat
B Ren
CE Lawrence
CO Pabo
DM Rocke
DS Johnson
DS Latchman
E Segal
E Wingender
FG Falkner
G Stolovitzky
GD Stormo
H Lähdesmäki
HA Ingraham
Harri Lähdesmäki
J Mintseris
J Van Helden
JE Darnell
Kirsti Laurila
M Barkett
M Kasowski
M Nykter
Mark Isalan
Matti Annala
Matti Nykter
MF Berger
MF Berger
MJ Solomon
ML Bulyk
ML Bulyk
OG Berg
P Agius
PV Benos
R Tibshirani
S Gupta
S Mukherjee
TL Bailey
V Litvak
V Orlando
X Chen
X Liu
XS Liu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Protein binding microarrays (PBM) are a high throughput technology used to characterize protein-DNA binding. The arrays measure a protein's affinity toward thousands of double-stranded DNA sequences at once, producing a comprehensive binding specificity catalog. We present a linear model for predicting the binding affinity of a protein toward DNA sequences based on PBM data. Our model represents the measured intensity of an individual probe as a sum of the binding affinity contributions of the probe's subsequences. These subsequences characterize a DNA binding motif and can be used to predict the intensity of protein binding against arbitrary DNA sequences. Our method was the best performer in the Dialogue for Reverse Engineering Assessments and Methods 5 (DREAM5) transcription factor/DNA motif recognition challenge. For the DREAM5 bonus challenge, we also developed an approach for the identification of transcription factors based on their PBM binding profiles. Our approach for TF identification achieved the best performance in the bonus challenge

Aaltodoc Publication Archive

CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing

Author: A Sandelin
A Siepel
AC Siepel
AC Siepel
AM Moses
AM Moses
AM Moses
BE Engelhardt
C Bergman
C Boutilier
CM Bergman
D Boffelli
DA Papatsenko
EH Margulies
EP Xing
EP Xing
Eric P. Xing
GE Crooks
GJ Olsen
I Dubchak
J Felsenstein
J Felsenstein
J Felsenstein
J Pedersen
JD McAuliffe
M Blanchette
M Blanchette
M Blanchette
M Hasegawa
M Tompa
MC Frith
Mladen Kolar
MR Kantorovitz
MZ Ludwig
MZ Ludwig
MZ Ludwig
Pradipta Ray
PV Benos
R Siddharthan
RG Cowell
S Sinha
S Sinha
SB Montgomery
Suyash Shringarpure
T Wang
TH Jukes
Uwe Ohler
W Huang
Publication venue: Public Library of Science
Publication date: 01/06/2008
Field of study

Functional turnover of transcription factor binding sites (TFBSs), such as whole-motif loss or gain, are common events during genome evolution. Conventional probabilistic phylogenetic shadowing methods model the evolution of genomes only at nucleotide level, and lack the ability to capture the evolutionary dynamics of functional turnover of aligned sequence entities. As a result, comparative genomic search of non-conserved motifs across evolutionarily related taxa remains a difficult challenge, especially in higher eukaryotes, where the cis-regulatory regions containing motifs can be long and divergent; existing methods rely heavily on specialized pattern-driven heuristic search or sampling algorithms, which can be difficult to generalize and hard to interpret based on phylogenetic principles. We propose a new method: Conditional Shadowing via Multi-resolution Evolutionary Trees, or CSMET, which uses a context-dependent probabilistic graphical model that allows aligned sites from different taxa in a multiple alignment to be modeled by either a background or an appropriate motif phylogeny conditioning on the functional specifications of each taxon. The functional specifications themselves are the output of a phylogeny which models the evolution not of individual nucleotides, but of the overall functionality (e.g., functional retention or loss) of the aligned sequence segments over lineages. Combining this method with a hidden Markov model that autocorrelates evolutionary rates on successive sites in the genome, CSMET offers a principled way to take into consideration lineage-specific evolution of TFBSs during motif detection, and a readily computable analytical form of the posterior distribution of motifs under TFBS turnover. On both simulated and real Drosophila cis-regulatory modules, CSMET outperforms other state-of-the-art comparative genomic motif finders

Widespread Site-Dependent Buffering of Human Regulatory Polymorphism

The average individual is expected to harbor thousands of variants within non-coding genomic regions involved in gene regulation. However, it is currently not possible to interpret reliably the functional consequences of genetic variation within any given transcription factor recognition sequence. To address this, we comprehensively analyzed heritable genome-wide binding patterns of a major sequence-specific regulator (CTCF) in relation to genetic variability in binding site sequences across a multi-generational pedigree. We localized and quantified CTCF occupancy by ChIP-seq in 12 related and unrelated individuals spanning three generations, followed by comprehensive targeted resequencing of the entire CTCF–binding landscape across all individuals. We identified hundreds of variants with reproducible quantitative effects on CTCF occupancy (both positive and negative). While these effects paralleled protein–DNA recognition energetics when averaged, they were extensively buffered by striking local context dependencies. In the significant majority of cases buffering was complete, resulting in silent variants spanning every position within the DNA recognition interface irrespective of level of binding energy or evolutionary constraint. The prevalence of complex partial or complete buffering effects severely constrained the ability to predict reliably the impact of variation within any given binding site instance. Surprisingly, 40% of variants that increased CTCF occupancy occurred at positions of human–chimp divergence, challenging the expectation that the vast majority of functional regulatory variants should be deleterious. Our results suggest that, even in the presence of “perfect” genetic information afforded by resequencing and parallel studies in multiple related individuals, genomic site-specific prediction of the consequences of individual variation in regulatory DNA will require systematic coupling with empirical functional genomic measurements